Skip to content

Fix for stateful execution bug in llama-bench#58

Merged
cavusmustafa merged 5 commits intoravi9:dev_backend_openvinofrom
cavusmustafa:fix_stateful_state_sync
Mar 10, 2026
Merged

Fix for stateful execution bug in llama-bench#58
cavusmustafa merged 5 commits intoravi9:dev_backend_openvinofrom
cavusmustafa:fix_stateful_state_sync

Conversation

@cavusmustafa
Copy link
Collaborator

Stateful execution was failing due to kv cache buffer sync issues when llama-bench switches from prefill to decode. This PR fixes it and also make the "ov_runtime_context" singleton since the backend context seems to be reinitialized in llama-bench after prefill, before decode.

@cavusmustafa cavusmustafa requested a review from wine99 as a code owner March 7, 2026 00:18
@github-actions github-actions bot added the ggml label Mar 7, 2026
@cavusmustafa cavusmustafa merged commit 409cc8e into ravi9:dev_backend_openvino Mar 10, 2026
62 of 80 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants